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RAW SEQUENCE LISTING DATE: 12/07/2000 

PATENT AP P LICAT 10 N : US/ 09/464 , 377 TIME: :i 0 : U 3 : 4 0 

Input Set : A:\726sl.txt 

OutpuL Sfct: N:\CRF3\12O7 2O0O\I4 64377.raw 

<;ilO> APPLXCANV: University of Southern Calllrorn.i a 
Stall cup, Michael R. 

Hong/ Heug 
Asward/ Dana W. 

<120> TITLE OF INVENTION: REGULATION OF GENE EXPRESSION BY PROTEIN 

METHYLATION 
<130> FILE REFERENCE: 13 761-726 

<140> CURRENT APPLICATION NUMBER: US 09/464,377 

<141> CURRENT FILING DATE: 1999-12-15 

<150> PRIOR APPLICATION NUMBER: US 60/112,52 3 

<151> PRTOK PILING DATE: 1998-12-15 

<160> NUMBER OF SEQ ID NOS ; 10 

<170> SOF'IVJARE: FastSEO for V-Zindows Version 4.0 
SEO ID NO: 1 
LENGTH: 3124 
<2;i2> TYPE: DNA 
<213> ORGANISM: Mus musculuc 
<4 0 0> SEQUENCE: :l 

agggggcotg gagccggacc taagatggca gcggcggcag cgacgcjcggt ggggccgggt: 6 0 

gcggqgagcg ctggggtggc gggcccgggc ggcgcqgggc cctgcgctac agtgtctgtg 120 
ttcccgggcg cccgcctcct cactatcggc gac^cgaacg gcgagatcca gcggcacgcg IBO 
gagcagcagg cgctgcgcct tgaggtgcgc gccggaccag acgcggcggg catcgccctc 24 0 

tacagccatg aagatgt:gtg tgttttcaag tgctcggtgt: cccgatjagac agagtgcagt 300 
cgtgtgggca gacagtcctt catcatcacc ctgggctgca acagcgtcct catccagttt 3G0 
gccacacccc acgatttctg ttctttctac aacatcctga aaacctgtcg gggccacaca 420 
otggagcgct ctgtgtt.cag tgagcggaca gaggaatcct cagctgtgca gtacttccag 480 
LtctatggcL acctatccca gcagoagaac atgatgcagg actatgtr.gcg gacaggcacc 540 
Laccagcgtg cgatcctgca gaaccacrjcg gacttcaagg acaagatcgt tctagatgtg 600 
ggctgtggct ctgggatcct gtcat;ttttt gctgctcaag oaggagccag gaaaatttat 650 
gcagtggaag ccagcaccat ggctcagcat gcagaggtcc tggtgaagag taacaatctg 7 20 

acagaccgca tcgtggtcat ccctggcaaa gtagaggagg tctcattgcc tgagcaagtg 780 
gacattatca tctcagagcc catgggctac atgctcttca atgaacgaat gctcgagagc 84 0 

tacctccatg ccaaaaagta cctgaagcct agtggaaaca tgttccccac ca ttggtgat 9 00 

gtccacctcg cacccttcac tgatgaacag ctctacatgg agcagttcac caaagcoaac 960 
ttccggtacc agccaUcctt ccatggagtg gaoctgtcgg ccctcagagg tgccgctgtg '1020 
gatgagtact tccggcaacc tgtggtggac acatttgaca tccggatcct gatggccaaa 1080 
tctgtcaagt acacagtgaa cUtottagaa gccaaagaag gcgatttgca caggatagaa 1140 
atcccattca aa ttccacat gctgcattca gggctagtcc atggcttggc cttctggttc 12 00 
gatgttgott Lcattggctc cataatgacc gtgtggctaL ccacagcccc aacagagccc 12 6 0 
ctgacccact ggtaccaggt ccggtgcctc ttccagtcac cgttgtttgc caaggccggg 13 2 0 
gacacgctct cagggacatg tctgottatt gccaacaaaa gacagagcta tgacatcagt 1380 
attgtggcac aggtggacca gacaggctcc aagtccagta acctgctgga tctaaagaac 144 0 
cccttcttca ggtacacagg tacaacccca Lcacccccac ctggctcaca ctacacgtct 1500 
ccctcggaga atatgtggaa cacaggaagc acctataaLc tcagcagcgg ggtggctgtg 156 0 
gcLgyaatgc ctactgccta cgacctgagc agtgttattg ccggcggctc cagtgtgggt 1620 
cacaacaacc tgattccctt agctaacaca gggattgtca atcacaccca ctcccggatg 1680 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 64 ^ 3 7 7 
Input Set : A:\7 26sl.txt 

Output Set: N:\CRF3\1207200O\I464377.raw 



DATH:: 12/07/2000 
TIME; 10:03:40 



59 gcjctccaLaa tgagcacggtj cattcjtccaa gyctoctcacj 

60 agctccagtg cccactntgc agtcaacaac cagttcacca 

61 atggcctcgc ccatgtccat cccgaccaac accatgcact 

62 gccgcgacag cactgcgcac tgacagcacc aggaaacoaa 

63 agccagtggc tgLtccccct tgttctggag aagttgttga 
6^1 tgctatggga acttygacaa ttttgtacac gatigtcgccg 
55 cccaaccttt ggtcccgagc gcgtgttgct gccatacttt 
66 agccctcatc ctgttctgta ctctccactc tqacctggct 

6 7 caagtcctcc cccaaccccc acayctgcac ctgaccaggc 

68 caccacagac Gtggcagcac ccaccccaca acccgtcctt 

69 tggcagcaca gccagctgga cctctccttc aactaccagg 

70 tgacatgctg ctttttttaa ttttattttt ttac<iaaaag 

71 ccctctgaga aacccggctg gcgcgccaag ccagcagccc 

7 2 tctaggtgag gggtggccct gtcaagcctt cagagtgggc 
73 gttcacctca aacttgaatg tacaaaccac ccagctgtcc 
7 4 ctgctactigt cctgtcctga gccctgaagg cccccctcca 
7 5 cccagagtgt gtcacccLgg gctactgggg cagacaagaa 
7 6 acacacaagg aa9gcgtcv':t ctcctgatag ctgacatagg 

7 7 tcatgttrrta cttaatcctc tea a ga cage aaccctggga 

78 cccagacaag aagaaaagca aacaaggaag ggbgattaat 

79 attcccttac cctagagtcc ccacctgaat ggccacagcc 
HO aaagcjctgga gctgctctgt gccaccctcc Lgacctgtca 
fil gcagctggga accaggctct ctcctgtcca tcagtaatac 

82 ccccacottt atataaattc tctggatoac ctLtgcatag 

83 gtaa 

8 5 <210> SFJQ ID NO: 2 
8 6 <211> LENGTH; 608 

8 7 <2A2> TYPE: PRT 

8 8 <213> ORGANISM 

90 <220> FEATURE: 

91 <22 3> OTHER INFORMATION: 

9 3 <4 00> SEQUENCE: 2 

94 Met Ala Ala Ala Ala Ala Thr Ala VaJ. Gly Pro 

95 1 5 10 

96 Gly Val Ala Gly Pro Gly Gly Ala Gly Pro Cys 

97 20 25 

9 8 Phe Pro Gly Ala Arg Leu Leu Thr lie Gly Asp 

99 35 40 

100 Gin Arg His Ala Glu Gin Gin Ala Leu Arg Leu 

101 50 5 5 

102 Pro Asp Ala Ala Gly He Ala Len Tyr Ser His 

103 65 70 75 

104 Phe Lys Cys Ser Val Ser Arg Glu Thr Glu Cys 

10 5 8 5 90 

106 Gin Ser Phe He He Thr Leu Gly Cys Asn Ser 

107 100 105 

108 Ala Thr Pro His Asp Phe Cys Ser Phe Tyr Asn 

109 1,15 120 

110 Arq Gly His Thr Leu Glu Arg Ser Val Phe Ser 



gtgcccaggg 
tgggtggccc 
atgggagtta 
a tea a g tec a 
acacccggtc 
ctgceoucaa 
a cat gag ate 
ttgaeatctg 
aggaggaggc 
ycacetcccc 
ccacatggtc 
aaccagtgte 
ctgttcctag 
fjcagccectc 
aaaggee tag 
tcaaaagctt 
acctcaaaga 
cctgtgtgtL 
aggagcctcg 
aagcacaggc 
tgccacagga 
gggaatcaca 
tccttgetcg 
aaaa taaaag 



aggcggcggt 
tgcca tctct 
gg tgccteca 
ggcccggeac 
acagcctcet 
gtacccccag 
ctgttggggc 
ctggaagagg 
cagcagctgc 
tcacctgggg 
accatgggcg 
aacccacaga 
gcccagaggt 
ccaecaaagg 
tccctacttt 
gaacaggeag 
tctgtcacac 
gcgttcaca t 
cagggacete 
ag tttcccct 
accce ttggc 
gggccc tcag 
gatggcectc 
tgtttgcttt 



Artificial Sequence 



Deduced amino acid sequence ol CARMl 



Gly Ala Gly Ser Ala 
15 

Ala Thr Val Ser Val 

30 

Ala Asn Gly Glu He 
45 

Glu Val Arg Ala Gly 
60 

Glu Asp Val Cys Val 
80 

ser Arg val Gly Arg 
95 

Val Leu He Gin Phe 
110 

He Leu Lys Thr Cys 
12 5 

Glu Arg Thr Glu Glu 



17 40 
1800 
18G0 
1920 
1980 
2040 
2100 
21 GO 
2220 
22B0 

23 4 0 
2400 

24 6 0 
2520 
2580 
264 0 
2700 
2760 
2820 
28 8 0 
294 0 
30 0 0 
3060 
3120 
3124 
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Input: Set : A:\7 26sl.txt 

Output Set: N:\CRF3\12072000\I464377.raw 
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RAW SEQUENCE LISTING DATE: .12/07/2000 

PATENT APP.LICATIOM: US/09/4 64, 3 7 7 TIME: 10:03:40 

input Sot : A:\726sl.txt 

Output Set: N:\CRF3\1207 2000\I4 64 377.raw 

160 Ser Ser Val Gly His Asa Asn Igu lie Pro Leu Ala Asn Thr CUy Tie 

161 530 535 540 

162 Val Asn His Thr His Ser Arg Met Gly Ser lie Met Ser Thr Gly lie 

16 3 54 5 5 50 555 56 0 

164 Val Gin Gly Ser Ser Gly Ala Gin Gly Gly Gly Gly Ser Ser Ser Ala 

165 565 570 575 

166 His Tyr Ala Val Asn Asn Gin Phe Thr Met Gly Gly pro Ala lie Ser 

167 580 535 590 

163 Met Ala Ser Pro Met Ser lie Pro Thr Asn Thr Met His Tyr Gly Ser 
169 595 600 605 

171 <210> SEO ID NO: 3 

172 <211> LENGTH: 608 

17 3 <212> TITPE: PHT 

174 <213> ORGANISM: Artili.cial Sequence 

176 <220> FEATURE: 

177 <223> OTHER INFORMATION: CARI41 VLD TO AAA Variant 

179 <4 00> SEQUENCE: 3 

180 Met Ala Ala Ala Ala Ala Thr Ala Val Gly Pro Gly Ala Gly Ser Ala 

181 1 5 10 15 

182 Gly Val Ala Gly Pro Gly Gly Ala Gly Pro Cys Ala Thr Val Ser val 

183 20 25 30 

184 Phe Pro Gly Ala Arg Leu Leu Thr Tie Gly Asp Ala Asn Gly Glu Tie 

185 35' 40 45 

186 Gin Ary His Ala Glu Gin Gin Ala Leu Arg Leu Glu Val Arri Ala Gly 

187 50 55 60 

1B8 Pro Asp Ala Ala Gly lie Ala Leu Tyr Ser His Glu Asp Val Cys Val 

189 65 7 0 75 80 

190 Phe Lys Cys Ser Val Ser Arg Glu Thr Glu Cys Ser Arg Val Gly Arg 

191 85 90 95 

192 Gin Ser Phe lie Tie Thr Leu Gly Cys Asn Ser Val Leu Tie Gin Phe 

193 100 105 110 

194 Ala Thr Pro His Asp Phe Cys Ser Phe Tyr Asn lie Leu Lys Thr c;ys 

195 115 120 125 

196 Arg Gly His Thr Leu Glu Arg Ser Val Phe Ser Glu Ax^g Thr Glu Glu 

197 130 135 140 

198 Ser Ser Ala Val Gin Tyr Phe Gin Phe Tyr Gly Tyr Leu Ser Gin Gin 

199 145 150 155 IGO 
2 00 Glu Asn Met Met Gin Asp Tyr val Arg Thr Gly Thr Tyr Gin Arg Ala 
201 165 170 175 

2 02 lie Leu Gin Asn His Thr Asp Phe Lys Asp Lys lie Ala Ala Ala Val 

203 180 185 190 

204 Gly Cys Gly Ser Gly lie Leu Ser Phe Phe Ala Ala Gin Ala Gly Ala 

205 195 200 205 

206 Arg Lys lie Tyr Ala val Glu Ala Ser Thr Met Ala Gin His Ala Glu 

207 210 215 220 

208 Val Leu Val Lys Ser Asn Asn Leu Thr Asp Arg lie Val Val He Pro 

209 225 230 235 240 

210 Gly Lys Val Glu Glu Val Ser Leu Pro Glu Gin Val Asp He Tie He 

211 245 250 255 
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RAW SEQUENCE LISTING 0AT}3 : .12/07/2000 

PATENT APPLICATION : US/09/4 6 4^377 TXIiE : 1.0 : 03 : 4 0 



Input: Set : A:\7 26sl.txt 

Output Set: N:\CRF3\120720QO\l464377.raw 
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Pro 


Pro 


Pro 


Gly 


Ser 


His 


Tyr 


Thr 


Ser 


Pro 


Ser 


Glu 


Asn 


241 










4 8 5 










4 90 










4 9 5 




242 


Met 


T rp 


Asn 


Thr 


Gly 


Ser 


Thi" 


Ty r 


Asn 


Leu 


Ser 


Se.r 


Gly 


Va.! 


A. La 


Va.L 


2 43 








500 










JU 9 










~» 1 U 






24 4 


Ala 


GJ.y 


Met 


Pro 


Thr 


Ala 


Tyr 


Asp 


Leu 


Ser 


Ser 


Val 


I le 


Ala 


Gly 


Gly 


24 5 






515 










520 










D ZD 








"^4 6 


Ser 


Ser 


Val 


Gly 


His 


Asn 


Asn 


Leu 


He 


Pro 


eu 


a 


Asn 


T h r 


G I V 


Tie 


24 7 




53 0 










535 










54 0 










24 8 


Val 


Asn 


His 


Thr 


His 


Ser 


Arg 


Met 


Gly 


Ser 


I le 


Met 


Ser 


Thr 


Gly 


He 


249 


545 










550 










555 










560 


250 


Val 


Gin 


Gly 


Ser 


ser 


Gly 


Ala 


Gin 


Gly 


Gly 


Gly 


Gly 


Ser 


Ser 


Ser 


Ala 


2 51 










565 










570 










57 5 




2 52 


His 


Tyr 


Ala 


Val 


Asn 


Asn 


G.ln 


Phe 


Thr 


Met 


Gly 


Gly 


Pro 


Ala 


Tie 


Ser 


253 








5B0 










58-5 










590 






2 54 


Me t 


Ala 


Ser 


Pro 


Met 


Ser 


lie 


Pro 


Thr 


Asn 


Thr 


Met 


Hi s 


Tyr 


Gly 


Ser 


2 55 






595 










600 










605 








2 57 


<210 


\> SEQ ID NO: 


4 
























258 


<211.> LENGTH 


[: 10 
























259 


<212> TYPE: 


PRT 


























260 


<213 


:> ORGANISM: 


Artif.ical Sequence 
















262 


<220 


i> FEATURE: 
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